Genome size, genetic diversity, and phenotypic variability imply the effect of genetic variation instead of ploidy on trait plasticity in the cross-pollinated tree species of mulberry

Elucidation of genome size (GS), genetic and phenotypic variation is the fundamental aspect of crop improvement programs. Mulberry is a cross-pollinated, highly heterozygous tree eudicot, and comprised of huge ploidy variation with great adaptability across the world. However, because of inadequate information on GS, ploidy-associated traits, as well as the correlation between genetic and phenotypic variation hinder the further improvement of mulberry. In this present research, a core set of 157 germplasm accessions belonging to eight accepted species of Morus including promising functional varieties were chosen to represent the genetic spectrum from the whole germplasm collection. To estimate the GS, accessions were subjected to flow cytometry (FCM) analysis and the result suggested that four different ploidies (2n = 2x, 3x, 4x, and 6x) with GS ranging from 0.72±0.005pg (S-30) to 2.89±0.015pg (M. serrata), accounting~4.01 fold difference. The predicted polyploidy was further confirmed with metaphase chromosome count. In addition, the genetic variation was estimated by selecting a representative morphologically, diverse population of 82 accessions comprised of all ploidy variations using simple sequence repeats (SSR). The estimated average Polymorphism Information Content (PIC) and expected heterozygosity showed high levels of genetic diversity. Additionally, three populations were identified by the model-based population structure (k = 3) with a moderate level of correlation between the populations and different species of mulberry, which imply the effect of genetic variation instead of ploidy on trait plasticity that could be a consequence of the high level of heterozygosity imposed by natural cross-pollination. Further, the correlation between ploidies, especially diploid and triploid with selected phenotypic traits was identified, however, consistency could not be defined with higher ploidy levels (>3x). Moreover, incite gained here can serve as a platform for future omics approaches to the improvement of mulberry traits.

Introduction Mulberry (Morus spp.) has been commercially exploited as the host of monophagous pest silkworm, Bombyx mori L. [1,2]. It belongs to the Moraceae family comprised of 37 genera with more than 1,100 species [3]. The genus Morus has over 10 species with more than 1,000 cultivated varieties spanning Asia, Europe, Africa, and the United States [4,5]. Efforts were made to classify Morus spp., however, to date, taxonomic nomenclature remains doubtful [6]. Besides that, genetics of inheritance are also complicated in Morus spp. As an account of the occurrence of a higher level of heterozygosity as well as polyploidy (or whole-genome duplication, WGD), one of the most important evolutionary processes in higher plants [7,8].
Cytology studies in mulberry are more complicated because of extreme variations in chromosome size and number [9]. However, since 1920, various researchers attempt to understand meiotic behaviour, because it's considered a very important aspect of breeding programmes. Mulberry has a wide range of ploidy ranging from haploid (n = x = 14) to docosaploid (2n = 22x = 308) [4, 10, 11], for example, M. notabilis was reported as "haploid" in nature (2n = 2x = 14,  [9]. The majority of triploids (2n = 3x = 42) and tetraploids (2n = 4x = 56) have been identified in M. laevigata [12]. Hexaploid species (2n = 6x = 84), such as M. serrata [13] and M. tiliaefolia [14] are also recognized, and earlier reports also suggest that polyploidy can extend up to docosaploid (2n = 22x = 308) as in M. nigra [10,11]. The economically important species of mulberry available in India are M. alba, M. indica, M. atropurpurea, M. nigra, M. serrata, M. latifolia, and M. laevigata [13,15]. Moreover, metaphase chromosome count, detecting the ploidy status, size, and nature of chromosomes and behaviour in pairing and segregation during meiosis has been restricted toa few popular mulberry varieties and polyploid species only like M. serrata, M. tiliaefolia and M. nigra [10, [16][17][18][19][20][21][22][23]. Despite our increasing recognition of the chromosomal number or ploidy level, we are still unaware of the biological effects of ploidy in mulberry. Application of FCM and GS estimations of a large collection of mulberry are missing, which is considered to be a preciseway to generate useful information for the analysis of the genetic relationship, phenotypic diversity, estimation of ploidy-associated traits, and other fundamental aspects of mulberry genomes.
In the last few decades, genetic relationship analysis of mulberry is an important component of crop improvement programs and molecular markers are the most popular to understand the diversity between Morus species [24,25]. Primarily, Xiang et al. [26] applied the Random amplified polymorphic DNA (RAPD) technique to illustrate variation among different species of Morus. Later Inter-Simple Sequence Repeats (ISSR) work has been reported on the genetic diversity analysis amongst the mulberry plants grown in India [24,27]. Additionally, several restrictions limit the estimation of the genetic diversity of mulberry as highlighted by earlier reports like-(a) lack of sufficient molecular markers in mulberry [28]; and (b) lack of codominant markers to construct the genetic map, which is considered to be a major constraint towards marker-assisted breeding for targeted trait improvement [25,29]. As mulberry has a highly cross-pollinated heterozygous genome, Simple Sequence Repeats (SSRs) are the most appropriate DNA markers to use for analyzing the genetic diversity of mulberries [28, [30][31][32]. SSR-based approaches appear to be one of the best DNA markers for detecting genetic diversity within a species for several reasons, especially for revealing high levels of polymorphisms [33]. In Morus spp. SSR markers developed from M. indica are used to illustrate genetic relationships and origin of cultivated mulberry [34]. However, limited studies on SSR markers-based genetic diversity and model-based population structure analysis of a wide collection of mulberry germplasm have been conducted.
In brief, in the present study, we have tried to highlight a few fundamental aspects of mulberry core set collections like (1) variation of GS (FCM-based), (2) genetic diversity (SSRbased), and (3) phenotypic variation (growth-yield and reproductive trait-based) of a wide collection of cross-pollinated, highly heterozygous tree species of genus Morus to develop a model-based population structure and identify ploidy-associated traits. The information generated in the present study may serve as a platform for future omics approaches for the improvement of mulberry traits.

Plant materials
A total of 157 accessions of the eight different Morus species including popular varieties cultivated in India were used in this study, which was obtained from the core collection maintained at the a). Central Sericultural Germplasm Resources Centre (CSGRC) (csgrc.res.in), Hosur, India; b). Central Sericultural Research and Training Institute (CSRTI), Mysore, India; and3). Karnataka State Sericulture Research and Development Institute, Karnataka, India. The common name, accession number, species name, and place of origin (S1 Table). Accepted nomenclature for the synonymous and unresolved species was considered from The Plant List (theplantlist.org) database (description available in S1 Table).

Genome size (GS) estimation using flow cytometry (FCM)
To estimate the GS, flow cytometry (FCM) analysis was performed by using the modified protocol as described by Galbraith et al. [35]. For analysis, nuclei were isolated from 40 mg of fresh young leaves (~5 mm 2 ) of mulberry along with the internal reference standard Pisum sativum (2C = 9.09 pg, diploid). The tissue sample was taken in a Petri dish comprised of 2 ml of buffer solution (Hypotonic propidium iodide, 50 μg/ml in 3 g/L trisodium citrate dihydride containing 0.05% (v/v) of non-ionic detergent equivalent (Igepal CA-630) containing 2 mg/ mL RNase A) and immediately co-chopped to homogenate using a sharp razor blade for 60 sec at 4˚C. The homogenate solution was filtered through a 37 μm nylon filter to remove large debris. Samples were incubated at 37˚C for 30 min and FCM analysis was performed using BD LSR Fortessa™ cell analyzer (BD Biosciences). Signals of PI-stained nuclei were processed with an excitation filter and absorbance filter by the flow cytometer. For each accession, three samples were examined and analyzed using BD FACS Diva 8.0.3 Software. GS (in mega base pairs, Mbp) was calculated according to the formulae given by Dolezel et al. [36] [Ploidy = Reference ploidy ×(mean position of the G1 sample peak/mean position of the G1 reference peak)] with the conversion of 1 pg equal to 980 Mbp. saturated p-dichloro benzyne (PDB) with 2 drops of 8-hydroxyquinoline (8HQ) in 1.5 ml Eppendorf. After the shoot collection, meristems were transferred to a 0˚C freezer for 5 min and it is subjected to transfer to 4˚C for 16-20h. After that, meristems were transferred to icecold 3 parts of 100% ethanol: 1 part of glacial acetic acid and incubated at RT for 1h. Then samples were transferred to 4˚C for 3 days. After that, samples were incubated with the enzymatic solution comprised of pectinase (2.5% w/v), cellulase (1%), and pectolyase (2%) for 4-6h. Then, samples were transferred to 1% orcine stain for 24h at RT, and during microscopy 2 drops of Acetoorcine stain were used to squash the sample. Chromosome count was performed using Cilika (Madeprime, India) microscope.

Molecular analysis
Eighty-two germplasm were selected based on collection site, open/cross-pollinated hybrid, and breeding information (functional hybrids, natural hybrids, cultivars, etc.) and different ploidy levels covering all available Morus spp. (82 accessions of S1 Table) for genetic diversity analysis. For DNA isolation, 1-2 younger leaves were selected and the modified cetyltrimethylammonium bromide, CTAB method was followed [37]. The quality of genomic DNA was examined by agarose (0.8%) gel electrophoresis and quantified using a Nanodrop DNA quantifier (Thermo Fisher Scientific, NanoDrop 2000 UV-Vis). Reliable and reproducible twenty polymorphic nSSR primer pairs were selected after the screening of 62 nSSR markers for molecular characterization as mentioned by Mathithumilan et al. [28]; and Pinto et al. [29] (Primers in S3 Table). The SSR genotyping was carried out in 10 μl of PCR reaction containing 50 ng of DNA, 2x PCR Amplicon master mix, and 10 picomoles of SSR primers (Eurofins Pvt. Ltd., Bengaluru) using the PCR program: 94˚C for 5 min, followed by 35 cycles of 94˚C for 45sec, specific annealing temperature (50˚C to 56˚C) for 30 sec, 72˚C for 45 sec for extension, and72˚C for 8 min for the final extension using a thermal cycler (GeneAmp 9700, Applied Biosystems, USA). The final PCR product was separated on 3% agarose gel containing 0.5μgethidium bromide in 1X TBE buffer and gel images were documented in the gel documentation system (GeneGenius, Syngene, UK). Twenty polymorphic SSR marker data were numerically coded as presence = 1, absence = 0, and missing data were coded as '-1' as suggested in the GenAlExV6.5 user manual [38]. Gel images of twenty polymorphic SSR markers of 49 representative accessions of mulberry are presented in S1 Fig along with the scored SSR data (S4 Table).

Genetic diversity parameters and polymorphism information content (PIC)
Genetic diversity parameters such as the observed number of alleles (N a ), the effective number of alleles (N e ), observed heterozygosity (H o ), expected heterozygosity (H e ), and Shannon's information index (I) for each SSR marker were computed using POPGENE v. 1.32 [39], while Polymorphism Information Content (PIC) using CERVUS 3.0.7 [40]. Power of discrimination (PD) = 1 − SPi 2 , where Pi = frequency of genotype i, and probability of identity (PI) = 1 − PD for 20 SSR markers was calculated as described by Huang et al. [41]. Whereas, Gen-AlEx V6.5 was used for species-wise diversity measures i.e. percentage of polymorphic loci (% P), N a , N e , I, h (diversity) and uh (unbiased diversity), pairwise population matrix of Nei's genetic distance and pairwise PhiPT values.

Population structure and cluster analysis of Morus using SSR markers
Mulberry germplasm individual molecular data were scored from SSRs banding patterns and STRUCTURE software [42] was used to infer the population structure of 82 mulberry accessions. An admixture model was used with the option of correlating allele frequencies between populations. Tenin dependent runs were conducted for the number of populations (K) ranging from 1 to 12 with 10 iterations for each value of K. The length of burn-in Markov Chain Monte Carlo (MCMC) replications was set to 500,000 and data were collected from over 500,000 MCMC replications in each run. We identified the optimal value of K using both the ad hoc procedure described by Pritchard et al. [42] and the method developed by Evanno et al. [43] with the help of Structure Harvester software Earl and Von [44]. Mulberry accessions were assigned to the subpopulations based on the cluster assignment probability (Q). Further cluster analysis of 82 germ plasm was determined using DARWIN software, version 6.0.21 [45]. Principle Coordinate Analysis (PCoA) and Analysis of Molecular Variance (AMOVA) was performed using the software GenAlEx V6.5.

Morphological characters
To correlate phenotypic traits with ploidies, morphologically unique 80 mulberry accessions were chosen to represent the genetic spectrum from the whole germplasm collection. The data was recorded for 9 characters viz. inter-nodal distance (cm), leaf-lamina length (cm), leaf-lamina width (cm), leaf area (sq. cm), petiole length (cm), petiole width (cm), mature inflorescence length (cm), mature fruit length (cm) and mature fruit width (cm) for the two seasons during 2020 and 2021. All the characters were recorded based on DUS test guidelines for Mulberry (plantauthority.gov.in/sites/default/files/mulberry.pdf) and FAO (fao.org/3/AD107E/ ad107e0v.htm#TopOfPage). To estimate the inter-nodal distance (InD, cm), the total length of the longest shoot was divided by the total number of nodes after 90 days of pruning. For, leaflamina length (LLL, cm) fully grown leaves from the 7 th to 9 th position in the longest shoot were selected, and measured the leaf blade length from the leaf base at the juncture of the petiole attachment to the leaf tip. Additionally, for leaf-lamina width (LLW, cm), the width of the leaf was taken from the widest point on both sides of the leaf margins. The leaf area (LA, cm 2 ) of the mulberry was measured using a leaf area meter (Biovis PSM L3000). To measure Petiole length (PL, cm), the petiole portion was separated from the base of the leaf blade and measured, and for petiole width (PW, cm), the thickness was measured using Vernier caliper from the same petioles that were used for measuring PL.
To measure the mature inflorescence length (MIL, cm), fully bloomed inflorescence of male (before anthesis), female (at the receptive stage), and bisexual catkins depending upon the sex expression of the accession were used and measure the length including the pedicel of the inflorescence in a minimum of three inflorescence/plant and three plants/accession. For the measurement of mature fruit length (MFL, cm) fruits fully ripened was collected and the length of the full fruit including the peduncle was recorded, whereas for mature fruit width (MFW, cm) thickness was recorded using Vernier calipers. Parameters like Ind, LLA, LLW, LA, PA, and PW were measured after 90 days of pruning, whereas, MIL, MFL, and MFW were measured after 60 days of pruning.

Statistical analysis
The normality test (Shapiro-Wilk test) and homogeneity of variance test (Levene's test) were carried out (the dependent variable for each group is not normally distributed and variances of the groups are not equal). Kruskal-Wallis H test followed by Dunn's post hoc tests (p < 0.05) was conducted to know the significance among the ploidy groups. All statistical analyses were carried out using IBM SPSS Statistics 23.0. The descriptive statistics (mean, maximum, minimum values, standard deviation, and standard error) were calculated for replicated data. A box plot for ploidy groups was drawn for all characters using the R 3.5.3 version (R Core Team [46]) and R Studio Team [47]. The mean values were used for Spearman rank correlation, ploidy groups PCA, and ploidy-species annotated heatmap analysis with MVappa (https://mvapp.kaust. edu.sa; [48]) and ClustVis (https://biit.cs.ut.ee/clustvis, [49]) online tools, respectively.

Intra and interspecific ploidy variation of Morus spp
The GS estimation of the core collections available in India is not yet investigated so far. Therefore, the estimation of the GS of the 157 potential germplasm encompasses different species of Morus that were collected from tropic, subtropical, and temperate areas all over the world, including functional hybrids, popular varieties cultivated in India, and are accounted for in this study (S1 Table). Results indicate that-1. GS estimation signifying that the nDNA content of studied mulberry accessions ranged from 0.72 to 2.89pg, the highest amount of 2.89pgDNA was found in M. serrata (hexaploid genome) and the least value of 0.72pg was recorded in S-30 (diploid genome) (Fig 1A; S1 Table). The nDNA content in the replication of the same accession was almost the same (0.02 to 0.09; S1 Table).
2. The mean genome content of 0.86, 1.24, and 1.79pg was recorded for diploids, triploids, and tetraploids, respectively. The average GS ranges between 0.71 to 0.98pg for diploids, 1.16 to 1.29pg for triploids, 1.45 to 2.02pg for tetraploids, and 2.88 to 2.91pg for hexaploids are substantially different ploidy levels are differentiated as per the GS estimated (Fig 1A).
3. The mulberry accession in the same species varied slightly (Fig 1B). For example, 0.73pg and 0.98pg of GS were observed in SRDC-2 and Kokuso-20, which belongs to M. alba. Similarly, accessions of M. indica GS ranged between 0.72 to 0.98pg in diploids (S1 Table). 4. Among the predicted four ploidy, accessions of tetraploids showed the highest variability in GS, ranging from 1.45 to 2.02pg (±0.03) (Fig 1A).  Table).

Predicted ploidy level indicates wild collections of
6. The maximum variability at ploidy levels was observed from M. macroura, out of seventeen studied accessions, six are tested as higher ploidy levels (Fig 1B).
7. Additionally, FCM data indicates that with increasing ploidy levels, event count consistently decreased. The less event count might be the consequence of polyploidy-mediated increasing cell size with decreasing cell number (Fig 2A-2E).

Chromosome counting and validation of ploidy
The present cytological study validated and confirmed the chromosomal numbers of 12 selected mulberry accessions comprising different ploidy groups such as diploid varietiesV1,   (Fig 2A-2E).   Table).

Genetic diversity parameters, polymorphism information content (PIC) and AMOVA
To evaluate the overall partitioning of genetic diversity among studied germplasm collections, an AMOVA was executed using the genetic distance matrix. The AMOVA revealed that 94% of molecular variance was partitioned withinspecies and only 6% among species (S6 Table). The overall PhiPT was 0.058 (p-value = 0.001) suggesting a low level of genetic differentiation among studied Morusspecies. The pairwise population PhiPTand Nei's genetic distance were analyzed to determine the genetic relationship among Morus species (S7 Table). The highest PhiPT (0.120) and Nei's genetic distance (0.085) were observed between M. macroura and Morus spp. revealing that these species are more divergent. Whereas the lowest  Table).

Population genetic structure and cluster analysis
The SSR-basedclustering analysis of different Morus spp. was carried out using multidimensional Principal Coordinates Analysis (PCoA), Unweighted Pair Group Method with Arithmetic Mean (UPGMA) based Neighbor-joining clustering, and Bayesian model-based complementary clustering approaches. The PCoA (Fig 3A) revealed that Morus species were grouped into three major genetic clusters with a high level of intermixing as complemented by UPGMA-based NJ clustering. The first three coordinates of PCoA only explained 25.68% of the total variation (PC1: 9.90%; PC2: 8.94% and PC3: 6.84%; Fig 3A).
Additionally, the UPGMA cluster analysis was used to describe the genetic relationships among the 82 Morus accessions (Fig 3C). According to the dendrogram analysis and dissimilarity coefficient measured using the Neighbor-joining method, it is depicted that the markers were successful in segregating among the mulberry accessions irrespective of species. The dendrogram grouped all accessions into three major genetic clusters-I, II, and III (See more detailed in S8 Table). Cluster I consists of 25 accessions and the majority of them were M. indica (13) and M. alba (9) conferring theirclose genetic relationships as depicted by pairwise population PhiPTand Nei's genetic distance. ClusterII consists of 33 accessions and has two sub-clusters, Cluster II-1 and II-2 comprising 19 and 14 accessions respectively. Interestingly, sub-cluster II-1 comprises M. macroura (11) and M. alba (8) (2), and M. serrata (1) and found to be the highest admixture cluster, since more accessions of which are mulberry varieties developed from two different species (Fig 3C). Further, other than cluster I, higher ploidy level accessions are distributed in the entire groups projecting that polymorphic SSRs detected in our study were not able to discriminate amongdiploids, triploids, and tetraploids.
The existence of population structure in Morus accessions was resolute using STRUCTURE analysis based on the determination of optimal K-value (Fig 4A-4C). The results indicated that Morus accessions were grouped into three clusters (Fig 4C). The cluster-I represents 35 accessions and the majority of them belong to M. indica (16) and M. alba (11), cluster-II represents 17 accessionsmajority of them belong to M. macroura (8)
Ploidy-species annotated phenotypic data was represented in hierarchical clustering with heat maps and principal component analysis (Fig 6C, 6D). Results indicate that 82 accessions were clustered into Cluster I and Cluster II, whereas, each cluster was divided into two subclusters (SCI and SCII). Primary groupings for entries were denoted by different colours (9 groups). Cluster I comprised one tetraploid, four triploids, and twenty-nine diploids. On the other hand, Cluster II have comprised of hexaploidy, one tetraploid, two triploids, and thirtysix diploids. Hence, out of six, four triploids were grouped in Cluster I (SCII) and higher MIL and MFL were observed in this cluster of genotypes. In general, the heat map demonstrated that Cluster I comprised triploids and comprised higher MIL and MFL than diploids. Although, the value of LLW, LLL, LA, PT, and PL were higher for the genotype belonging to diploids as compared to other ploidies. Additionally, PCA data (Fig 6C, 6D) indicates that the expansion of traits was restricted for diploids as compared to triploids. Data also suggests that few accessions of diploids have a higher value for specific traits. For example, accessions MI-0540 and MI-0673 were outgroup 95% confidence circles because of a higher value of internodal distance (InD, 13.50 cm) and leaf area (LA, 451 cm 2 ), respectively.

Estimation of GS and intra-specific variation
Germplasm is an enduring resource management mission and asset for civilization and to fully explore the potential of germplasm material, particularly for the complex traits associated with yield in the face of the current threats of climate change, it is necessary to reconcile research advancement and innovation of practices [50]. There are several crop germplasm characterized in reference to ploidy level variation, and genetic and phenotypic diversity to explore the potential of germplasm material as well as elucidated consequences of polyploidization on functional traits [51][52][53][54][55]. Additionally, polyploidy (or whole genome duplication, WGD) has been recognized as a source of evolutionary force and its role in species diversification has been well understood [56]. Despite its importance, GS variation across the species and ploidy-level have not been considered ingermplasm material like tree plants mulberry, therefore itremains to be explored for a better understanding of fundamental aspects like (1) expansion of polyploidization/WGD occurred in mulberry; (2) preferred polyploid advantages
In the present section, we discuss the status of available polyploids of mulberry to understand polyploidy variation across the species level and try to address the first question. To address how and to what extent WGD occurred in mulberry, 157 mulberry accessions belonging to seven different species including popular varieties cultivated in India were selected and subjected to genome content analysis using FCM. GS data indicates that it ranged from 0.72 (M. indica, diploid) to 2.89 pg (M. serrata, hexaploid). Thus, the smallest GS: highest GS content ratio is about 4.01 (2.89/0.72) was observed within the studied population. Previous genome content studies revealed the 2C value of mulberry species ranged between 0.70 to 0.73 pg for diploid, 1.04 to 1.17 pg for triploid, 1.42 to 1.63 pg for tetraploid, and 2.02 to 2.48 pg for hexaploid, and 7.26 pg for decosaploid [11]. Thus, the present results corroborate with a https://doi.org/10.1371/journal.pone.0289766.g006 previous study that a higher level of variation is present both at the species level as well as within the accessions of the same genus. However, the estimated GS of diploid M. alba differed from the previous study by Ohri and Kumar [57], it was around 1.7pg for the n = 14. Whereas, diploid M. alba was estimated to be 1.7 pg/2C measured by micro densitometry using Feulgen stained which might have some technical shortcomings [58,59].
In our study the intraspecific variations of diploid GS data indices that about 34%, 37%, and 18% variation among the genotypes belonging to M. indica ( . [11]. However, these estimates differ from previous FCM assessments, possibly because of differences in the DNA reference standard, dyeing time, and fluorochrome difference in the current study [65,66]. Propidium iodide (PI) is commonly referred to for GS estimation through FCM as used in our present study [11,65].
That GS variability does occur among the various genotypes at an intraspecific level due to the involvement of different genomes during hybridization and selection as well as naturally. Further GS variations are also observed among the genotypes collected from different ecological conditions. Mulberry is reported to be a higher adaptability tree plant with a huge variety of species and ploidy. The use of wild species in breeding provides a lot of possibilities, to incorporate the traits such as drought, frost, and disease resistance into cultivated varieties [67]. Breeding of mulberry takes about 12-16 years and to develop a variety, ploidy compatibility is an essential aspect of breeding. To date, the varieties (like V1, K2, S13, MSG2, etc.) released by the researchers are not sufficient for the future sericulture industry, and an urgent need to develop climate-resilient mulberry varieties, including diploids and triploids. Ploidy and GS can influence reproductive compatibility, fertility, and heritability of traits hence our data can be useful for understanding the breeding potential between the different species of mulberry. In this context, we have provided a new platform for future ploidy breeding by focusing on GS and diversity analysis of mulberry accessions available in India and their relationship.
Besides the FCM analysis, classical cytological work to estimate the metaphase chromosome number is essential to validate the chromosomal number, which is lacking in previous studies [10,57,59]. Attempts were made to count metaphase chromosome numbers using shoot-tips of original plants and present cytological evidence corroborates with previous studies [10, 13, 11-22, 68], which strengthening our FCM analysis.
Till 2020, chromosome configuration, genome size, meiotic association, and chromosome assembly at the genetic level were restricted in Morus. However, the controversy over the ploidy of M. alba was clarified by Jiao et al. [69] that the chromosome number (2n) of M. alba, is 28 (diploid) which provides a basis for a better understanding of the developmental and regulatory mechanism of economic target traits. Moreover, present GS estimation for diploids, triploids, tetraploids, and hexaploids, across the species level, was found to be technically foolproof to discriminate the higher ploidy level in the mulberry, because, among the different methods for GS estimating, FCM is now considered as the most precise method [70,71]. GS varies significantly across plant life; however, variation of GS in vascular plants is associated with evolutionary consequences on plant development and ecological performance [63,72]. Moreover, the present research generated GS of 157 core collections, in turn; the information can be useful for effective genetic improvement and genetic diversity investigations. Additionally, the present result and discussion clarify that Morus spp. having a higher level of GS variation, not restricted at the species level but also within the accessions of the same genus.

Genetic diversity based on SSR markers
Understanding of genetic diversity of mulberry is another important aspect of the future breeding program. Genomics-based breeding was reported to be an effective and promising approach to reaching sustainable crop improvement and is more relevant specifically for perennial crops such as mulberry [73]. Despite the significant progress achieved through conventional breeding, even though to date it is distressingly slow, mainly because of the perennial growth habit and complex inheritance pattern [73]. Additionally, the adoption of modern genomic approaches for crop improvement is severely constrained by the lack of sufficient molecular markers in mulberry [28]. Finally, the paucity of genomic information for co-dominant marker systems has been a serious obstacle to using molecular breeding for the advancement of mulberry genetics [29].
SSRs are simple tandemly repeated DNA sequences comprising units of 1 to 6 nucleotide(s) and are ubiquitous in plant genomes [74,75]. Because of their distinct qualities of co-dominant inheritance, multi-allelic nature, vast genome coverage, high abundance, and especially good reproducibility, SSRs are the most widely recognized genetic markers that are actively used in plant breeding [76]. SSR markers are extensively employed in plant research for a wide range of studies including DNA fingerprinting, high-density mapping, genome comparative mapping, population diversity study, and assisted breeding [75]. Although the genetic distances, evolution, and kinship of Morus have been studied by using molecular marker technology, the conclusions are still controversial [77,78]. In the present research, a set of stable and highly polymorphic SSR markers were employed as evidenced by greater PIC (0.556), PD (0.602), the observed heterozygosity (0.477), and a lower PI value of 0.398. Hence, markers' inheritance clearly explains their potential as genetic resources in diversification studies. Moreover, the information generated in the present research based on SSR marker-based genotyping, higher PIC, PD, and observed heterozygosity can be efficiently used for future molecular breeding programs of the mulberry species/genotypes. Recently, Jain et al. [79], reported single-nucleotide polymorphism (SNPs) ranging from 241,897 (S1) to 657,137 (M. rotundiloba) in different mulberry accessions such as K2 reference genome with other accessions namely Punjab Local, M. indica, BR-8, M. multicaulis, and M. serrata showed higher genetic variations with �0.5 million SNPs which is conformity in our finding that phylogenetic analysis through SSRs revealed diverse relationship between selected species of Morus genome. SSR analysis of the mulberry germplasm confirmed a complicated genetic background and high genetic diversity. However, SSR markers were not able to differentiate among ploidy levels hence flow cytometry method is reliable for the determination of mulberry accession at the ploidy level. But, SSRs analysis is trustworthy for an understanding of the degree of genetic variation where the breeding of germplasm and mulberry germplasm conservation gets benefited. Moreover, the present research was focused to estimate genetic diversity using simple sequence repeats (SSRs), and coming to understand the phylogenetic relationship and population structure. Insight gained herein suggested the effect of genetic variation instead of ploidy, which might be the consequence of the high level of heterozygosity imposed by natural cross-pollination.
Additionally, population structure analyses are another important aspect of the present study because it reflects the relationship between GS and marker variation in the population structure analyses. The population structure analysis revealed the existence of three genetic clusters with a high level of intermixing, a high level of within-species genetic diversity, and a weak genetic structure among the studied Morus species. The combined clustering analysis revealed that only the majority of accessions of M. indica (CusterI) and M. macroura (ClusterII) were grouped in a specific cluster, while the accessions of M. alba, M. indica, and Morus spp. were distributed throughout the three major clusters suggesting a high level of genetic admixture in these species. Such results are attributed to the outbreeding and wind pollination reproductive system of Morus species [80]. In turn, the present finding suggests genetic background of species is the account of admixture between the groups that occurred due to a high level of heterozygosity as the result of gene exchange between the species during the outbreeding reproductive system. In addition, accessions were collected from different countries and they are continuously used for mulberry breeding programmes and which resulted in a mixed population. For example, M. alba species originating from France and Indonesia are grouped separately from those originating from Japan and India. Understanding the molecular diversity and modifications in genome content triggered by WGD, not only helps in determining the evolution but also shapes the plant genome leading to adaption, response to stress, and diversification

Phenotypic trait plasticity and its relation to genome size (GS)
Polyploidy has been considered a source of evolutionary development, species expansion, as well as adaptation. Especially in plants, polyploids are most frequent events, which occurred as a consequence of WGD events and appear to be associated with environmental conditions. Hence, the present research tried to understand the phenotypic plasticity among the ploidy groups (2x, 3x, 4x, and 6x) in Morus spp. However, female floral features, notably style length, were a well-recognized attribute used to classify mulberries in terms of female accessions [83,84]. Though, recently quantitative vegetative traits were considered and shown to be beneficial for mulberry plants [59]. Moreover, in the present research, we have considered both vegetative and reproductive traits to recognize the effect of WGD on phenotypic plasticity. The correlation results indicate a significant positive correlation between GS with traits viz., mature inflorescence length (MIL, 0.45), and Mature fruit length (MFL, 0.38). Similarly, Wang et al. [85] observed a weak significant positive correlation between GS and fruit size in Chinese jujube. Whereas, the relationship between DNA content and the inflorescence characters was non-significant [86]. Additionally, multiple trait-based PCA referred to GS linked with leaf area (LA), which is the account of the contribution of MIL, MFL, and InD (positive coordinate) and LLW, LLL, PT, and PL (X + , Ycoordinate). Though, further research will require investigating how and to what extent GS is intimate to physiological and/or anatomical traits in mulberry.

Conclusions
An intriguing area of mulberry (Morus spp.) research is to understand genetic complexity as their genetic makeup is complicated by cross-pollinate behavior with a huge number of ploidy variations across multiple wild species. In the present study, we have tried to highlight the intraspecific diversity of GS, phenotypic traits, and genetic variation of a worldwide collection of 157 mulberry accessions. Present investigation revealed that core collections bear a huge genome content variation, from diploid to hexaploids an about 4.01-fold difference of GS. Additionally, the phenotypic study suggests that a significant positive correlation between GS and mature fruit length (MFL) and mature inflorescence length (MIL). Furthermore, the population structure analysis revealed the presence of three genetic clusters with a high level of admixture, which implies the effect of genetic variation instead of ploidy on trait plasticity that could be a consequence of the high level of heterozygosity imposed by natural cross-pollination. Moreover, the information generated in the present study will complement further genetic diversity studies to identify unique/duplicate accessions, and GS/ploidy information as well as ploidy-associated traits will be useful for breeding and crop improvement. Besides mulberry breeding, the present study signifies downsize of traits with upscaling of GS, for example, petiole length (PL, diploids, 4.3±1.01 cm> triploids, 4.28±0.81 cm>tetraploids 4.18±1.34 cm). Therefore, future research to understand the consequences of polyploid mulberry need to be explored because biomechanical expenses of higher ploidy are more towards the maintenance of large cell size, large genome size, nucleic acid biosynthesis, and carrying out its physiological activities.